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Abstract 

The SWI/SNF-like chromatin-remodeling protein ATRX has emerged as a l<ey factor in the regulation of a-globin gene 
expression, incorporation of histone variants into the chromatin template and, more recently, as a frequently mutated 
gene across a wide spectrum of cancers. Therefore, the availability of a functional ATRX cDNA for expression studies is 
a valuable tool for the scientific community. We have identified two independent transposon insertions of a bacterial 
ISIO element into exon 8 of /\7'RX isoform 2 coding sequence in two different plasmids derived from a single source. 
We demonstrate that these insertion events are common and there is an insertion hotspot within the ATRX cDNA. Such 
ISIO insertions produce a truncated form of ATRX, which significantly compromises its nuclear localization. In turn, we 
describe ways to prevent ISIO insertion during propagation and cloning of ATRX-containing vectors, including optimal 
growth conditions, bacterial strains, and suggested sequencing strategies. Finally, we have generated an insertion-free 
plasmid that is available to the community for expression studies of ATRX. 

Keywords: ATRX; Insertion element; Chromatin remodeling; Cloning vector; ATRX over-expression; ISIO element 



Background 

ATRX is an ATP-dependent chromatin-remodeling heli- 
case of the SWI/SNF family and was originally identified 
because mutations in its coding region give rise to a com- 
plex disorder known as a-Thalassemia, Mental Retardation 
X-linked (ATRX) syndrome. Its role in the a-Thalassemia 
feature of this syndrome is through its regulation of the 
chromatin state of the a-globin genes (Higgs et al. 
2005; Gibbons 2006; Law et al. 2010; Ratnakumar et al. 
2012). However, its role in other ATRX syndrome traits 
remains largely unknown. Mechanistically, ATRX has 
been implicated in heterochromatin maintenance and 
formation, DNA methylation, regulation of gene expres- 
sion, alternative lengthening of telomere suppression and 
histone variant deposition (for review see: Berube 2011; 
Ratnakumar and Bernstein 2013; Clynes et al. 2013). 
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Recently, large-scale genome sequencing efforts have 
identified ATRX mutations and deletions in a range of 
tumor types, including glioblastoma multiforme, neuro- 
blastoma and pancreatic neuroendocrine tumors (Jiao 
et al. 2011; Schwartzentruber et al. 2012; Cheung et al. 
2012; Molenaar et al. 2012; Clynes et al. 2013). These 
alterations appear to result in loss of functional ATRX, 
suggesting that ATRX acts as tumor suppressor. How- 
ever, the mechanisms by which such mutations result 
in ATRX loss-of-function remain unclear. 

For the reasons noted above, ATRX has emerged as an 
important chromatin remodeler to study during normal 
development and disease. However, the cloning and 
overexpression of ATRX cDNA has been challenging due 
to its large size: 2,492 amino acids encoded in an 11,167 
bp cDNA (isoform 1) and 2,454 amino acids encoded in 
an 11,031 bp cDNA (isoform 2). Here, we report two 
independent transposon insertions of an ISIO element 
into exon 8 of the ATRX isoform 2 coding sequence in 
two different plasmids derived from a single original 
source. Our data suggest that ATRX contains at least one 
insertion hot spot that is highly active when the plasmid is 
grown under standard bacterial conditions. Therefore, 
we recommend strategies to prevent ISIO insertions 
during propagation and cloning of ATRX-containing 
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vectors. Furthermore, we analyzed the functional effect of 
this insertion and conclude that ISIO insertions produce 
a truncated form of ATRX that compromises its cellular 
distribution. 

Results and discussion 

We sequenced two different ATRX plasmids that have been 
widely used for overexpression experiments (Additional 
file 1) (Berube et al. 2002, 2008; McFarlane and Preston 
2011; Ratnal<umar et al. 2012; Newhart et al. 2012). One of 
the plasmids contains the full length ATRX cDNA sequence 
in a pEGFP (GFP-ATRX) (Berube et al. 2008) backbone 
and the other is cloned in a lentiviral backbone (ATRX- 
YFP) and was derived from the sequence contained in 
GFP-ATRX (Newhart et al. 2012). We found that the ATRX 
cDNA sequence that has been cloned corresponds to 
isoform 2, which lacks exon 6 in comparison to isoform 
1. Exon 6 codes for 38 highly conserved amino acids. 
The functional relevance, if any, between isoform 1 and 
isoform 2 has yet to be reported. In addition, we found 
that El 24 is missing in ATRX isoform 2. This amino 
acid is the last residue encoded by exon 5. 

To our surprise, we uncovered transposon insertions 
of ISIO elements within the ATRX coding sequence in 
both plasmids analyzed (Figure lA-B). The transposable 
prokaryotic IS family has been reported to be a common 
source of mutations in the Escherichia coli genome 
(Rodriguez et al. 1992). Likewise, it has also been shown 
that ISlO-related sequences may lead to cloning aberra- 
tions and sequence contamination of plasmids hosted in 
E. coli (Rood et al. 1980; Muller et al. 1989; Kovarik 
et al. 2001; Kobori et al. 2009; Umenhoffer et al. 2010). 
Interestingly, the ISIO insertion in both plasmids - which 
we name ISIO-GFP-ATRX and ISIO-ATRX-YFP - was 
found in exon 8. However, sequencing and PGR analyses 
showed that both the insertion site and the direction of 
the ISIO element are different in each plasmid, suggesting 
independent insertion events (Figure 1A,B, Additional 
file 1). The insertion site within ISIO-GFP-ATRX closely 
resembles the 9-bp canonical ISIO insertion sequence 
(Additional file 1, see Kovarik et al. 2001), and may repre- 
sent a hotspot for ISIO transposition. The insertion site 
within ISIO-ATRX-YFP has a more divergent sequence. It 
is interesting to note that ISIO-ATRX-YFP was derived 
from ISIO-GFP-ATRX, however there is no sequence 
evidence of an ISIO excision in ISIO-ATRX-YFP. This 
suggests that the ATRX cDNA of ISIO-GFP-ATRX used 
to derive ISIO-ATRX-YFP was lacking the ISIO inser- 
tion, and that the insertion observed in the latter was 
acquired during the cloning or plasmid expansion 
process. This highlights the dynamic feature of this 
transposable element. 

In order to assess if the expression of the truncated form 
of ATRX derived from the ISIO-GFP-ATRX plasmid may 



retain some of the characteristics of full length ATRX, 
we sub-cloned the ATRX cDNA sequence from the 
ISIO-GFP-ATRX plasmid to generate an 'Insertion-Free' 
vector that we call IF-GFP-ATRX (see details in Methods). 
IF-GFP-ATRX was further analyzed by PGR and restric- 
tion analysis (data not shown) and the ATRX cDNA 
was fully sequenced to ensure that the insertion was 
completely removed (Additional file 1). 

Next we investigated if the insertions are an isolated 
event or a common phenomenon. IF-GFP-ATRX was 
transformed into both DH5a and Stbl4 E. coli strains. 
The plasmids were analyzed by colony PGR with primers 
spanning the ISIO insertion sites found in ISIO-ATRX- 
YFP and ISIO-GFP-ATRX (Additional file 1, see Methods). 
We found that 18% of the DH5a colonies (26 out of 139 
analyzed colonies) present evidence of insertions within 
exon 8. Sequencing and restriction analysis of the plasmid 
containing insertions revealed that the observed change is 
caused by de novo insertions of the ISIO transposable 
element (Figure IC, Additional file 1). Conversely, the 
plasmid derived from the Stbl4 bacteria - a strain spe- 
cifically designed for cloning unstable inserts - showed 
a significant lower insertion rate (3 out of 80 analyzed 
colonies - p-value < 0.01, binomial test). Furthermore, 
we found evidence of other aberrations in the DH5a 
colonies by restriction analyses (6 out of 30 analyzed 
colonies that did not contain the exon 8 ISIO insertion; 
data not shown). This suggests that the ATRX plasmid 
may be subjected to ISIO insertions in other regions of 
the ATRX cDNA or other types of alterations such as 
recombination events. This data strongly suggests that 
insertions within the ATRX coding sequence in E. coli are 
a common event that is influenced by the bacterial strain 
used to amplify ATRX-containing plasmids. Moreover, our 
results suggest that ATRX presents an ISIO insertion hot- 
spot within exon 8. The causes that trigger the insertion of 
the ISIO element within the ATRX cDNA remain unclear. 
However, the homogeneity of the restriction patterns in 
the IF-GFP-ATRX with de novo ISIO insertions (Figure IC) 
suggests that the ISIO transposition is an early clonal 
event. In fact, it has been proposed that the stress induced 
by the transformation process may be a trigger for ISIO 
transposition (Kovarik et al. 2001). 

We next transfected ISIO-GFP-ATRX and IF-GFP- 
ATRX into HEK 293T cells and assessed ATRX expression 
and chromatin association. We found that the transfection 
efficiency was approximately double for ISIO-GFP-ATRX 
(60%) compared to IF-GFP-ATRX (30%), and that the 
sub-cellular localization was dramatically different (data 
not shown. Figure 2A). As expected, IF-GFP-ATRX trans- 
fected cells show a focal nuclear localization of GFP-ATRX 
(Figure 2A) in agreement with previous reports (Xue et al. 
2003; Tang et al. 2004; Ishov et al. 2004; Berube et al. 2008). 
However, we observed a clear cytoplasmic GFP signal in 
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Figure 1 ATRX overexpression plasmlds contain IS10 insertions. (A) Schematic map of the insertions found in IS10-GFP-ATRX and IS10-ATRX- 
YFP. The exon structure (alternate white and gray bars) of ATRX cDNA isoform 2 (top) and some relevant domains from the protein sequence (bottom) 
are shown. The insertion sites are denoted by black bars and their positions in the cDNA and the protein are described in the right box. Black arrows 
show the direction of the IS10 elements found. The stop codons introduced by ISIO in the protein sequence are indicated with asterisks. The position 
of two of the primer pairs used for analyzing ATRX sequence is shown (white and gray arrows). The NLS within ATRX cDN/\ is indicated with a black 
bar. (B) PGR analysis of IS10-GFP-ATRX, IS10-ATRX-YFP and IF-GFP-ATRX using the primers shown in (A). Both amplicons of IF-GFP-ATRX have 
the expected size (1,449 bp for amplicon I and 1,935 for amplicon II) while amplicon I shows an insertion in ISIO-ATRX-YFP and amplicon I 
shows an insertion in IS10-GFP-ATRX (both have additional 1,335 bp). (C) Representative EcoRI digestion patterns of the IF-GFP-ATRX plasmid 
grown in two different bacteria strains. The Stbl4-derived plasmid has the expected pattern while the plasmid derived from DH5a shows an 
insertion in the 2,400bp fragment (shift from a 2,458 bp fragment to -3,800 bp). 



the ISIO-GFP-ATRX transfected cells and a more diffuse 
nuclear localization (Figure 2A). According to our sequence 
analysis, a stop codon is introduced right after the ISIO 
insertion site in both plasmids containing the ISIO element 
(Figure lA). As ATRX is N-terminally GFP-tagged, the 
fluorescence observed in the ISIO-GFP-ATRX transfec- 
tants is likely due to the overexpression of a truncated 
GFP-ATRX fusion that contains only the ADD domain 
(Figure lA). 

Although we cannot exclude the possibility that the 
re-localization of the ISIO-GFP-ATRX is due to an over- 
expression effect, ATRX nuclear localization is dependent 
on nuclear localization sequences (NLS) (Berube et al. 
2008) and our sequence analysis predicts that the ISIO- 
GFP-ATRX lacks the NLS (FigurelA). However, we 
note that although ATRX cytoplasmic localization is 
clear in the ISIO-GFP-ATRX transfected cells, cells still 
do show some nuclear localization (Figure 2A). Because 
the ADD domain alone is capable of recognizing and 
binding the histone H3 tail in vitro by reading the com- 
binatorial methylation state of histone H3 lysines 4 and 
9 (Eustermann et al. 2011; Iwase et al. 2011) a small 
fraction of the expressed protein product may still be 
tethered to chromatin. 



To probe this further and assay the chromatin associ- 
ation of the ATRX protein products, we prepared both 
whole cell and chromatin extracts from the transfected 
HEK 293T cells and performed Western blot analyses. As 
predicted, we found a truncated ATRX product from 
the transfections performed with the ISIO-GFP-ATRX 
plasmid (Figure 2B,C). This truncated product is ~70 
kDa and corresponds to the ATRX ADD domain coupled 
to the N-terminal GFP. This is further supported by the 
fact that the ISIO-GFP-ATRX product is detectable only 
with a GFP antibody and not with an ATRX antibody 
whose epitope recognizes the C-terminal portion of the 
protein (data not shown). 

While we observed chromatin-bound full-length ATRX 
from the IF-GFP-ATRX transfections, we noted that 
the ISIO-GFP-ATRX truncated product also binds to 
the chromatin fraction (Figure 2C). This supports the 
idea that the ADD domain allows binding to chromatin. 
Whether the ISIO-GFP-ATRX product has any effect 
on the chromatin landscape remains to be explored, for 
example, it could act as a dominant negative. However, it 
is interesting to note that in comparison to full-length 
ATRX, the ISIO-GFP-ATRX fragment has a greater 
accumulation in whole cell extracts (Figure 2B) than 
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Figure 2 ISIO-element alters GFP-ATRX localization and incorporation into chromatin. (A) HEK-293T cells were transiently transfected with 
ISIO-GFP-ATRX (left) or IF-GFP-ATRX (right). The localization of GFP-ATRX was determined by confocal fluorescence microscopy 72 hours post-transfection. 
Representative images are shown. Dashed lines on the highlighted cells indicate nuclei area. ISIO-GFP-ATRX displays a diffuse nuclear and cytoplasmic 
localization. Conversely, ATRX-GFP has a nuclear localization in the IF-GFP-ATRX transfected cells with a focal pattern. (B) Western analysis of whole cell 
protein extracts from the ISIO-GFP-ATRX and IF-GFP-ATRX transfections against GFP (top) and actin (bottom). Bands corresponding to IF-GFP-ATRX and 
ISIO-GFP-ATRX products are indicated with arrows. (C) Chromatin extracts from the ISIO-GFP-ATRX and IF-GFP-ATRX transfections were analyzed with 
GFP antibodies (top). Bands corresponding to IF-GFP-ATRX and ISIO-GFP-ATRX products are highlighted with arrows. Amido black staining of histones 
was used as a loading control (bottom). A truncated product is observed in the ISIO-GFP-ATRX transfections. 



in chromatin (Figure 2C). Taken together, these results 
strongly suggest that the ISIO element provokes the 
production of a truncated ATRX form that has a diffuse 
cellular localization, but still retains the capacity to bind to 
chromatin. Importantly, during experimental conditions, if 
one were to perform immunofluorescence of ISIO-GFP- 
ATRX transduced cells without Western blot analysis, one 
might conclude that full length ATRX is expressed. 

Conclusions 

Here we show that two ATRX overexpression plasmids, 
utilized in the ATRX research community, contain ISIO- 
element insertions around a probable hot spot in exon 8 
of ATRX. Further, we demonstrate that ISIO transposition 
events are common if >ir7?X-containing vectors are grown 
in standard bacteria strains. We note here that the 
dynamic nature of the insertions makes it likely that 
any single lab may have batches of contaminated and 



insertion-free plasmid. We further highlight that the 
published studies of ATRX function using these plasmids 
may have indeed been insertion-free, as we have confirmed 
for our own studies (Ratnakumar et al. 2012). However, 
because the ISIO-element has a significant effect on ATRX 
sub-cellular localization and allows its incorporation 
into chromatin, experiments performed with ISIO- 
ATRX-containing plasmids could generate inaccurate 
conclusions. 

It has not escaped our attention that despite all the 
mutations identified in ATRX syndrome patients and, 
more recently, in distinct cancer types (Jiao et al. 2011; 
Schwartzentruber et al. 2012; Cheung et al. 2012; Clynes 
et al. 2013), few ATRX over-expression experiments 
mimicking such mutations have been published. It is our 
belief that to understand the contribution of ATRX 
mutations in disease etiology, we need to develop reliable 
over-expression systems. However, our results show that 
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these experiments are challenging and subjected to tech- 
nical hurdles that the community should be aware of. 

Finally, we have made our IF-GFP-ATRX plasmid avail- 
able through the AddGene database (plasmid 45444). 
Also, we provide technical advice for the propagation 
and cloning of ATRX-containing vectors (see Methods 
section). The sequences obtained from the ISIO-GFP- 
ATRX and ISIO-ATRX-YFP plasmids, as well as the 
primers used for the ATRX sequencing, cloning and 
analysis are included in the Additional file 1. Thus, we 
recommend that each laboratory should take the necessary 
steps to avoid ISIO contamination and test each plasmid 
preparation for ISIO insertion elements. We are hopeful 
that this technical resource will be useful for future 
studies and will help to avoid propagation and usage of 
ISlO-ATRX-containing plasmids. 

Methods 

ATRX-containing plasmid propagation and handling 

The ATRX plasmids were propagated using ElectroMAX™ 
Stbl4™ Competent Cells (Life Technologies, 11635-018). 
Topic (Invitrogen, C404003) and DH5a bacteria were 
also used and are not recommended for growing A77?X- 
containing plasmids. We recommend growing bacteria at 
30°C to avoid plasmid recombination. Using strains lack- 
ing IS-elements such as MDS42 is also recommended. 

ATRX PCR analysis and sequencing 

PCR analyses were performed using Platinum* Pfx DNA 
Polymerase (Invitrogen 11708-013), according to manufac- 
turer's indications. The primers used span across the ATRX 
cDNA and their sequences can be found in Additional 
file 1. ATRX cDNA sequencing was performed using the 
same primers and one additional primer specific for 
exon 8 that can also be found in Additional file 1. We 
recommend analyzing the plasmid by PCR (or restriction 
digest) following culture in E. coli and on a frequent basis 
when working with ATRX-containmg vectors. 

ATRX transformation and growth in different 
bacterial strains 

ElectroMAX'" Stbl4™ competent cells (Life Technologies, 
11635-018) and DH5a competent cells (Life Technologies 
18258-012) were transformed according to manufacturer's 
indications with 50 ng of IF-GFP-ATRX. Cells were plated 
and growth at 37°C. Colonies from three independent 
transfections were picked and diluted in 100|il of PBS. 5 |il 
were used for colony PCR using the ATRX ISIO primers 
from Additional file 1. Colonies that presented evidence of 
insertion, as well as control colonies were minipreped 
using the QIAprep Spin Miniprep Kit (Qiagen, 27106) 2 
|ag of the plasmid were digested with 2 units of EcoRI 
(New England Biolabs, ROIOIS) for 1 hour at 37°C. 



IF-GFP-ATRX construct generation 

ISIO-GFP-ATRX was digested with BamHI and Aflll (NEB) 
to produce a 1200 bp band (Fragment A). A plasmid 
containing internal ATRX sequence was generated using 
GenScript services: (5'-ggtaccTACGTCAGCTTAAGGCT 
TTTAAGTCTGTGTTGGCTGATATTAAGAAGGCTCA 
TCTTGCATTGGAAGAAGACTTAAATTCCGAGTTTC 
GAGCGATGGATGCTGTAAACAAAGAGAAAAATAC 
CAAAGAGCATAAAGTCATAGATGCTAAGTTTGAAA 
CAAAAGCACGAAAAGGAGAAAAACCTTGTGCTTT 
GGAAAAGAAGGATATTTCAAAGTCAGAAGCTAAA 
CTTTCAAGAAAACAGGTAGATAGTGAGCACATGC 
ATCAGAATGTTCCAACAGAGGAACAAAGAACAAA 
TAAAAGTACCGGTGGTGAACATaagctt-3). A restriction 
digestion product was generated using the GenScript 
plasmid and enzymes, Aflll and Agel (Fragment B). A 
restriction digestion product of ISIO-GFP-ATRX was 
generated using enzymes, Agel (NEB) and Xbal (Fragment 
C). Finally, BamHI and Xbal (NEB) were used to digest 
pEGFP-C2 plasmid, generating a backbone for insertion 
of the ATRX fragments. The three ATT^X-containing 
fragments (A, B, and C) were ligated first (1200 bp + 
300 bp + 6000 bp). Then the backbone pEGFP vector was 
added for a second ligation reaction. Ligation product 
was transformed into damVdcm' competent E. coli (NEB 
C2925H). Colonies were mini-prepped and sequenced. 

Transfections 

Transfections were performed in the HEK-293T cell line 
(ATCC: CRL- 11268) using Lipofectamine" 2000 transfec- 
tion reagent according to manufacturer's indications. 14 |ig 
of each plasmid were transfected in 70% confluent 10 cm 
plates. Cells were harvested 48 hours post-transfection for 
chromatin preparation or fixed for microscopy (see below). 

Microscopy 

24 hours post-transfection, cells were seeded onto chamber 
well slides. 48 hours later, cells were fixed with 4% parafor- 
maldehyde/PBS (Sigma Aldrich) at 4°C overnight. Slides 
were stained with 4',6-diamidino-2-phenylindole (DAPI) 
for 10 minutes at room temperature. Images were collected 
with a Zeiss LSM700 confocal microscope. 

Chromatin isolation and western analysis 

Chromatin isolation was performed as described previ- 
ously (Mendez and Stillman 2000) and chromatin samples 
were boiled at 100°C for 10 minutes instead of being soni- 
cated. Western analysis was performed with the following 
antibodies: GFP (Santa Cruz sc-8334, 1:200 or Roche 
1181460001, 1:10,000), Actin (Sigma- Aldrich A1978, 0.75 
[ig/mV). Histones from chromatin extracts were stained 
with amido black staining solution (Sigma A8181-1EA). 
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Additional file 



Additional file 1: Plasmid and primer sequences. This files contains 
the sequences of the IS1 0 insertions, all the primers used for the analyses 
described on the paper, and the full IF-GP-ATRX sequence. 
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